1,610 research outputs found

    Detecting Sarcasm in Multimodal Social Platforms

    Full text link
    Sarcasm is a peculiar form of sentiment expression, where the surface sentiment differs from the implied sentiment. The detection of sarcasm in social media platforms has been applied in the past mainly to textual utterances where lexical indicators (such as interjections and intensifiers), linguistic markers, and contextual information (such as user profiles, or past conversations) were used to detect the sarcastic tone. However, modern social media platforms allow to create multimodal messages where audiovisual content is integrated with the text, making the analysis of a mode in isolation partial. In our work, we first study the relationship between the textual and visual aspects in multimodal posts from three major social media platforms, i.e., Instagram, Tumblr and Twitter, and we run a crowdsourcing task to quantify the extent to which images are perceived as necessary by human annotators. Moreover, we propose two different computational frameworks to detect sarcasm that integrate the textual and visual modalities. The first approach exploits visual semantics trained on an external dataset, and concatenates the semantics features with state-of-the-art textual features. The second method adapts a visual neural network initialized with parameters trained on ImageNet to multimodal sarcastic posts. Results show the positive effect of combining modalities for the detection of sarcasm across platforms and methods.Comment: 10 pages, 3 figures, final version published in the Proceedings of ACM Multimedia 201

    String Indexing for Patterns with Wildcards

    Get PDF
    We consider the problem of indexing a string tt of length nn to report the occurrences of a query pattern pp containing mm characters and jj wildcards. Let occocc be the number of occurrences of pp in tt, and σ\sigma the size of the alphabet. We obtain the following results. - A linear space index with query time O(m+σjloglogn+occ)O(m+\sigma^j \log \log n + occ). This significantly improves the previously best known linear space index by Lam et al. [ISAAC 2007], which requires query time Θ(jn)\Theta(jn) in the worst case. - An index with query time O(m+j+occ)O(m+j+occ) using space O(σk2nlogklogn)O(\sigma^{k^2} n \log^k \log n), where kk is the maximum number of wildcards allowed in the pattern. This is the first non-trivial bound with this query time. - A time-space trade-off, generalizing the index by Cole et al. [STOC 2004]. We also show that these indexes can be generalized to allow variable length gaps in the pattern. Our results are obtained using a novel combination of well-known and new techniques, which could be of independent interest

    Competition and Selection Among Conventions

    Full text link
    In many domains, a latent competition among different conventions determines which one will come to dominate. One sees such effects in the success of community jargon, of competing frames in political rhetoric, or of terminology in technical contexts. These effects have become widespread in the online domain, where the data offers the potential to study competition among conventions at a fine-grained level. In analyzing the dynamics of conventions over time, however, even with detailed on-line data, one encounters two significant challenges. First, as conventions evolve, the underlying substance of their meaning tends to change as well; and such substantive changes confound investigations of social effects. Second, the selection of a convention takes place through the complex interactions of individuals within a community, and contention between the users of competing conventions plays a key role in the convention's evolution. Any analysis must take place in the presence of these two issues. In this work we study a setting in which we can cleanly track the competition among conventions. Our analysis is based on the spread of low-level authoring conventions in the eprint arXiv over 24 years: by tracking the spread of macros and other author-defined conventions, we are able to study conventions that vary even as the underlying meaning remains constant. We find that the interaction among co-authors over time plays a crucial role in the selection of them; the distinction between more and less experienced members of the community, and the distinction between conventions with visible versus invisible effects, are both central to the underlying processes. Through our analysis we make predictions at the population level about the ultimate success of different synonymous conventions over time--and at the individual level about the outcome of "fights" between people over convention choices.Comment: To appear in Proceedings of WWW 2017, data at https://github.com/CornellNLP/Macro

    Comparison of Spectra in Unsequenced Species

    Get PDF
    International audienceWe introduce a new algorithm for the mass spectromet- ric identication of proteins. Experimental spectra obtained by tandem MS/MS are directly compared to theoretical spectra generated from pro- teins of evolutionarily closely related organisms. This work is motivated by the need of a method that allows the identication of proteins of unsequenced species against a database containing proteins of related organisms. The idea is that matching spectra of unknown peptides to very similar MS/MS spectra generated from this database of annotated proteins can lead to annotate unknown proteins. This process is similar to ortholog annotation in protein sequence databases. The difficulty with such an approach is that two similar peptides, even with just one mod- ication (i.e. insertion, deletion or substitution of one or several amino acid(s)) between them, usually generate very dissimilar spectra. In this paper, we present a new dynamic programming based algorithm: Packet- SpectralAlignment. Our algorithm is tolerant to modications and fully exploits two important properties that are usually not considered: the notion of inner symmetry, a relation linking pairs of spectrum peaks, and the notion of packet inside each spectrum to keep related peaks together. Our algorithm, PacketSpectralAlignment is then compared to SpectralAlignment [1] on a dataset of simulated spectra. Our tests show that PacketSpectralAlignment behaves better, in terms of results and execution tim

    Delineation of Cohen Syndrome Following a Large-Scale Genotype-Phenotype Screen

    Get PDF
    Cohen syndrome is an autosomal recessive condition associated with developmental delay, facial dysmorphism, pigmentary retinopathy, and neutropenia. The pleiotropic phenotype, combined with insufficient clinical data, often leads to an erroneous diagnosis and has led to confusion in the literature. Here, we report the results of a comprehensive genotype-phenotype study on the largest cohort of patients with Cohen syndrome assembled to date. We found 22 different COH1 mutations, of which 19 are novel, in probands identified by our diagnostic criteria. In addition, we identified another three novel mutations in patients with incomplete clinical data. By contrast, no COH1 mutations were found in patients with a provisional diagnosis of Cohen syndrome who did not fulfill the diagnostic criteria (“Cohen-like” syndrome). This study provides a molecular confirmation of the clinical phenotype associated with Cohen syndrome and provides a basis for laboratory screening that will be valuable in its diagnosis

    Bose-Einstein Correlations in e+e- to W+W- at 172 and 183 GeV

    Get PDF
    Bose-Einstein correlations between like-charge pions are studied in hadronic final states produced by e+e- annihilations at center-of-mass energies of 172 and 183 GeV. Three event samples are studied, each dominated by one of the processes W+W- to qqlnu, W+W- to qqqq, or (Z/g)* to qq. After demonstrating the existence of Bose-Einstein correlations in W decays, an attempt is made to determine Bose-Einstein correlations for pions originating from the same W boson and from different W bosons, as well as for pions from (Z/g)* to qq events. The following results are obtained for the individual chaoticity parameters lambda assuming a common source radius R: lambda_same = 0.63 +- 0.19 +- 0.14, lambda_diff = 0.22 +- 0.53 +- 0.14, lambda_Z = 0.47 +- 0.11 +- 0.08, R = 0.92 +- 0.09 +- 0.09. In each case, the first error is statistical and the second is systematic. At the current level of statistical precision it is not established whether Bose-Einstein correlations, between pions from different W bosons exist or not.Comment: 24 pages, LaTeX, including 6 eps figures, submitted to European Physical Journal

    W+W- production and triple gauge boson couplings at LEP energies up to 183 GeV

    Get PDF
    A study of W-pair production in e+e- annihilations at Lep2 is presented, based on 877 W+W- candidates corresponding to an integrated luminosity of 57 pb-1 at sqrt(s) = 183 GeV. Assuming that the angular distributions of the W-pair production and decay, as well as their branching fractions, are described by the Standard Model, the W-pair production cross-section is measured to be 15.43 +- 0.61 (stat.) +- 0.26 (syst.) pb. Assuming lepton universality and combining with our results from lower centre-of-mass energies, the W branching fraction to hadrons is determined to be 67.9 +- 1.2 (stat.) +- 0.5 (syst.)%. The number of W-pair candidates and the angular distributions for each final state (qqlnu,qqqq,lnulnu) are used to determine the triple gauge boson couplings. After combining these values with our results from lower centre-of-mass energies we obtain D(kappa_g)=0.11+0.52-0.37, D(g^z_1)=0.01+0.13-0.12 and lambda=-0.10+0.13-0.12, where the errors include both statistical and systematic uncertainties and each coupling is determined setting the other two couplings to the Standard Model value. The fraction of W bosons produced with a longitudinal polarisation is measured to be 0.242+-0.091(stat.)+-0.023(syst.). All these measurements are consistent with the Standard Model expectations.Comment: 48 pages, LaTeX, including 13 eps or ps figures, submitted to European Physical Journal

    Search for Neutral Higgs Bosons in e+e- Collisions at sqrt(s) ~189GeV

    Full text link
    A search for neutral Higgs bosons has been performed with the OPAL detector at LEP, using approximately 170 pb-1 of e+e- collision data collected at sqrt(s)~189GeV. Searches have been performed for the Standard Model (SM) process e+e- to H0Z0 and the MSSM processes e+e- to H0Z0, A0h0. The searches are sensitive to the b b-bar and tau antitau decay modes of the Higgs bosons, and also to the MSSM decay mode h0 to A0A0. OPAL search results at lower centre-of-mass energies have been incorporated in the limits we set, which are valid at the 95% confidence level. For the SM Higgs boson, we obtain a lower mass bound of 91.0 GeV. In the MSSM, our limits are mh>74.8GeV and mA>76.5GeV, assuming tan(beta)>1, that the mixing of the scalar top quarks is either zero or maximal, and that the soft SUSY-breaking masses are 1 TeV. For the case of zero scalar top mixing, we exclude values of tan(beta) between 0.72 and 2.19.Comment: 38 pages, 15 figures, submitted Euro. Phys. J.

    Measurements of Flavour Dependent Fragmentation Functions in Z^0 -> qq(bar) Events

    Get PDF
    Fragmentation functions for charged particles in Z -> qq(bar) events have been measured for bottom (b), charm (c) and light (uds) quarks as well as for all flavours together. The results are based on data recorded between 1990 and 1995 using the OPAL detector at LEP. Event samples with different flavour compositions were formed using reconstructed D* mesons and secondary vertices. The \xi_p = ln(1/x_E) distributions and the position of their maxima \xi_max are also presented separately for uds, c and b quark events. The fragmentation function for b quarks is significantly softer than for uds quarks.Comment: 29 pages, LaTeX, 5 eps figures (and colour figs) included, submitted to Eur. Phys. J.
    corecore